AITopics | Santa Catarina

Collaborating Authors

Santa Catarina

48bb60a0c0aebb4142bf314bd1a5c6a0-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 10:38:26 GMT

invariant, large language model, machine learning, (25 more...)

Neural Information Processing Systems

Country:

South America > Brazil > Santa Catarina > Florianópolis (0.04)
North America > United States > Texas > Harris County > Houston (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
Europe > Germany (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology (0.92)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

Quantum Fourier Transform Based Kernel for Solar Irrandiance Forecasting

Mechiche-Alami, Nawfel, Rodriguez, Eduardo, Cardemil, Jose M., Droguett, Enrique Lopez

arXiv.org Machine LearningNov-25-2025

This study proposes a Quantum Fourier Transform (QFT)-enhanced quantum kernel for short-term time-series forecasting. Exogenous predictors are incorporated by convexly fusing feature-specific kernels. For both quantum and classical models, the only tuned quantities are the feature-mixing weights and the KRR ridge α; classical hyperparameters (γ, r, d) are fixed, with the same validation set size for all models. Experiments are conducted on a noiseless simulator (5 qubits; window length L=32). Limitations and ablations are discussed, and paths toward NISQ execution are outlined. Introduction Quantum Machine Learning (QML) is an emerging discipline that combines the principles of quantum physics with traditional machine learning (ML) to exploit the distinctive characteristics of quantum systems, including superposition and entanglement phenomena [1]. This distinction facilitates the expeditious execution of certain tasks [2], such as classification and dimensionality reduction, where QML has demonstrated significant acceleration [3]. QML applications have extended to time-series data, leveraging quantum phenomena to model complex temporal dependencies. The goal is to enhance the results of traditional tasks by performing computations on qubits, which can process data more efficiently than classical bits [4, 5]. For example, Thakkar et al. [6] demonstrated that quantum machine-learning methods could enhance financial forecasting by improving both churn prediction and credit-risk assessment. Likewise, Kea et al. [7] developed a hybrid quantum-classical Long Short-Term Memory (QLSTM) to improve stock-price forecasting by leveraging quantum data encoding and high-dimensional quantum representations.

forecasting, kernel, quantum kernel, (16 more...)

arXiv.org Machine Learning

2511.17698

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(9 more...)

Genre: Research Report (0.64)

Industry:

Energy > Renewable > Solar (1.00)
Banking & Finance (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science > Data Quality > Data Transformation (0.85)

Add feedback

To unearth their past, Amazonian people turn to 'a language white men understand'

ScienceNov-6-2025, 19:00:00 GMT

The site, a few kilometers from her own hut in Ipatsé, a Kuikuro village in the Xingu Indigenous territory, was once the backyard of her great-grandparents' house. As she scrapes the brown earth with a trowel, she soon spots a black ceramic shard. It is only about the size of her palm, and this is her first day ever on an archaeological excavation. But she immediately recognizes what the object once was. "It's an alato," she says, showing the piece to a group of archaeologists and other Kuikuro who have gathered to watch the excavation in the village of Anitahagu. An alato, Yamána explains, is a large pan used to cook beiju, a white flatbread made with yucca flour that's eaten almost every day in her village. Her grandmother still has one in the backyard fire pit where she prepares most meals, just as countless Kuikuro women did before her. This alato likely belonged to her great-grandmother on her mother's side.

archaeology, heckenberger, kuikuro, (17 more...)

Science

Country:

South America > Ecuador (0.04)
South America > Brazil > São Paulo (0.04)
South America > Brazil > Santa Catarina (0.04)
(2 more...)

Genre: Research Report > New Finding (0.47)

Industry:

Education (1.00)
Government (0.94)
Energy (0.68)

Technology: Information Technology > Artificial Intelligence (0.69)

Add feedback

Benchmarking noisy label detection methods

Pickler, Henrique, Kamassury, Jorge K. S., Silva, Danilo

arXiv.org Machine LearningOct-21-2025

Label noise is a common problem in real-world datasets, affecting both model training and validation. Clean data are essential for achieving strong performance and ensuring reliable evaluation. While various techniques have been proposed to detect noisy labels, there is no clear consensus on optimal approaches. We perform a comprehensive benchmark of detection methods by decomposing them into three fundamental components: label agreement function, aggregation method, and information gathering approach (in-sample vs out-of-sample). This decomposition can be applied to many existing detection methods, and enables systematic comparison across diverse approaches. To fairly compare methods, we propose a unified benchmark task, detecting a fraction of training samples equal to the dataset's noise rate. We also introduce a novel metric: the false negative rate at this fixed operating point. We identify that in-sample information gathering using average probability aggregation combined with the logit margin as the label agreement function achieves the best results across most scenarios. Our findings provide practical guidance for designing new detection methods and selecting techniques for specific applications. Keywords: Noisy label detection, Noisy labels, Dataset cleaning, Data quality, Benchmark, Neural networks 1. Introduction Most supervised learning methods assume a perfectly labeled dataset. However, training data often contain incorrectly labeled instances. Even large, standard benchmark datasets, such as CIFAR, ImageNet, and MS-COCO, are known to have noisy labels [1, 2].

artificial intelligence, detection method, machine learning, (15 more...)

arXiv.org Machine Learning

2510.16211

Country:

South America > Brazil > Santa Catarina > Florianópolis (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > Switzerland (0.04)

Genre: Research Report > New Finding (0.87)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Diagnosis (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

48bb60a0c0aebb4142bf314bd1a5c6a0-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 01:22:40 GMT

benchmark, invariant, loop invariant, (15 more...)

Neural Information Processing Systems

Country:

South America > Brazil > Santa Catarina > Florianópolis (0.04)
North America > United States > Texas > Harris County > Houston (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
Europe > Germany (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology (0.92)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

Variable selection for minimum-variance portfolios

Moura, Guilherme V., Santos, André P., Torrent, Hudson S.

arXiv.org Machine LearningAug-22-2025

Machine learning (ML) methods have been successfully employed in identifying variables that can predict the equity premium of individual stocks. In this paper, we investigate if ML can also be helpful in selecting variables relevant for optimal portfolio choice. To address this question, we parameterize minimum-variance portfolio weights as a function of a large pool of firm-level characteristics as well as their second-order and cross-product transformations, yielding a total of 4,610 predictors. We find that the gains from employing ML to select relevant predictors are substantial: minimum-variance portfolios achieve lower risk relative to sparse specifications commonly considered in the literature, especially when non-linear terms are added to the predictor space. Moreover, some of the selected predictors that help decreasing portfolio risk also increase returns, leading to minimum-variance portfolios with good performance in terms of Shape ratios in some situations. Our evidence suggests that ad-hoc sparsity can be detrimental to the performance of minimum-variance characteristics-based portfolios.

data mining, machine learning, predictor, (17 more...)

arXiv.org Machine Learning

2508.14986

Country:

North America > United States (0.46)
Europe > Spain > Galicia > Madrid (0.04)
South America > Brazil > Santa Catarina > Florianópolis (0.04)
South America > Brazil > Rio Grande do Sul > Porto Alegre (0.04)

Genre: Research Report > New Finding (0.67)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.67)

Add feedback

On the Residual-based Neural Network for Unmodeled Distortions in Coordinate Transformation

Rofatto, Vinicius Francisco, de Almeida, Luiz Felipe Rodrigues, Matsuoka, Marcelo Tomio, Klein, Ivandro, Veronez, Mauricio Roberto, Junior, Luiz Gonzaga Da Silveira

arXiv.org Machine LearningMay-8-2025

Coordinate transformation models often fail to account for nonlinear and spatially dependent distortions, leading to significant residual errors in geospatial applications. Here we propose a residual-based neural correction strategy, in which a neural network learns to model only the systematic distortions left by an initial geometric transformation. By focusing solely on residual patterns, the proposed method reduces model complexity and improves performance, particularly in scenarios with sparse or structured control point configurations. We evaluate the method using both simulated datasets with varying distortion intensities and sampling strategies, as well as under the real-world image georeferencing tasks. Compared with direct neural network coordinate converter and classical transformation models, the residual-based neural correction delivers more accurate and stable results under challenging conditions, while maintaining comparable performance in ideal cases. These findings demonstrate the effectiveness of residual modelling as a lightweight and robust alternative for improving coordinate transformation accuracy.

artificial intelligence, distortion, machine learning, (18 more...)

arXiv.org Machine Learning

2505.03757

Country:

South America > Brazil > Minas Gerais (0.05)
South America > Brazil > Santa Catarina > Florianópolis (0.04)
Africa > Ghana (0.04)
(3 more...)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety

Dumitriu, Andrei, Tatui, Florin, Miron, Florin, Ralhan, Aakash, Ionescu, Radu Tudor, Timofte, Radu

arXiv.org Artificial IntelligenceApr-3-2025

Rip currents are strong, localized and narrow currents of water that flow outwards into the sea, causing numerous beach-related injuries and fatalities worldwide. Accurate identification of rip currents remains challenging due to their amorphous nature and the lack of annotated data, which often requires expert knowledge. To address these issues, we present RipVIS, a large-scale video instance segmentation benchmark explicitly designed for rip current segmentation. RipVIS is an order of magnitude larger than previous datasets, featuring $184$ videos ($212,328$ frames), of which $150$ videos ($163,528$ frames) are with rip currents, collected from various sources, including drones, mobile phones, and fixed beach cameras. Our dataset encompasses diverse visual contexts, such as wave-breaking patterns, sediment flows, and water color variations, across multiple global locations, including USA, Mexico, Costa Rica, Portugal, Italy, Greece, Romania, Sri Lanka, Australia and New Zealand. Most videos are annotated at $5$ FPS to ensure accuracy in dynamic scenarios, supplemented by an additional $34$ videos ($48,800$ frames) without rip currents. We conduct comprehensive experiments with Mask R-CNN, Cascade Mask R-CNN, SparseInst and YOLO11, fine-tuning these models for the task of rip current segmentation. Results are reported in terms of multiple metrics, with a particular focus on the $F_2$ score to prioritize recall and reduce false negatives. To enhance segmentation performance, we introduce a novel post-processing step based on Temporal Confidence Aggregation (TCA). RipVIS aims to set a new standard for rip current segmentation, contributing towards safer beach environments. We offer a benchmark website to share data, models, and results with the research community, encouraging ongoing collaboration and future contributions, at https://ripvis.ai.

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2504.01128

Country:

Oceania > New Zealand (0.24)
Oceania > Australia (0.24)
North America > Mexico (0.24)
(14 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science (0.93)
(2 more...)

Add feedback

Concept Map Assessment Through Structure Classification

Vossen, Laís P. V., Gasparini, Isabela, Oliveira, Elaine H. T., Czinczel, Berrit, Harms, Ute, Menzel, Lukas, Gombert, Sebastian, Neumann, Knut, Drachsler, Hendrik

arXiv.org Artificial IntelligenceMar-26-2025

Due to their versatility, concept maps are used in various educational settings and serve as tools that enable educators to comprehend students' knowledge construction. An essential component for analyzing a concept map is its structure, which can be categorized into three distinct types: spoke, network, and chain. Understanding the predominant structure in a map offers insights into the student's depth of comprehension of the subject. Therefore, this study examined 317 distinct concept map structures, classifying them into one of the three types, and used statistical and descriptive information from the maps to train multiclass classification models. As a result, we achieved an 86\% accuracy in classification using a Decision Tree. This promising outcome can be employed in concept map assessment systems to provide real-time feedback to the student.

artificial intelligence, concept map, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2503.22741

Country:

Europe > Germany > Hesse > Darmstadt Region > Frankfurt (0.05)
South America > Brazil > Santa Catarina (0.04)
South America > Brazil > Amazonas > Manaus (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.49)

Industry: Education > Curriculum > Subject-Specific Education (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

LimeSoDa: A Dataset Collection for Benchmarking of Machine Learning Regressors in Digital Soil Mapping

Schmidinger, J., Vogel, S., Barkov, V., Pham, A. -D., Gebbers, R., Tavakoli, H., Correa, J., Tavares, T. R., Filippi, P., Jones, E. J., Lukas, V., Boenecke, E., Ruehlmann, J., Schroeter, I., Kramer, E., Paetzold, S., Kodaira, M., Wadoux, A. M. J. -C., Bragazza, L., Metzger, K., Huang, J., Valente, D. S. M., Safanelli, J. L., Bottega, E. L., Dalmolin, R. S. D., Farkas, C., Steiger, A., Horst, T. Z., Ramirez-Lopez, L., Scholten, T., Stumpf, F., Rosso, P., Costa, M. M., Zandonadi, R. S., Wetterlind, J., Atzmueller, M.

arXiv.org Artificial IntelligenceFeb-27-2025

Digital soil mapping (DSM) relies on a broad pool of statistical methods, yet determining the optimal method for a given context remains challenging and contentious. Benchmarking studies on multiple datasets are needed to reveal strengths and limitations of commonly used methods. Existing DSM studies usually rely on a single dataset with restricted access, leading to incomplete and potentially misleading conclusions. To address these issues, we introduce an open-access dataset collection called Precision Liming Soil Datasets (LimeSoDa). LimeSoDa consists of 31 field- and farm-scale datasets from various countries. Each dataset has three target soil properties: (1) soil organic matter or soil organic carbon, (2) clay content and (3) pH, alongside a set of features. Features are dataset-specific and were obtained by optical spectroscopy, proximal- and remote soil sensing. All datasets were aligned to a tabular format and are ready-to-use for modeling. We demonstrated the use of LimeSoDa for benchmarking by comparing the predictive performance of four learning algorithms across all datasets. This comparison included multiple linear regression (MLR), support vector regression (SVR), categorical boosting (CatBoost) and random forest (RF). The results showed that although no single algorithm was universally superior, certain algorithms performed better in specific contexts. MLR and SVR performed better on high-dimensional spectral datasets, likely due to better compatibility with principal components. In contrast, CatBoost and RF exhibited considerably better performances when applied to datasets with a moderate number (< 20) of features. These benchmarking results illustrate that the performance of a method is highly context-dependent. LimeSoDa therefore provides an important resource for improving the development and evaluation of statistical methods in DSM.

algorithm, dataset, limesoda, (15 more...)

arXiv.org Artificial Intelligence

2502.20139

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Germany > Lower Saxony (0.14)
South America > Brazil > São Paulo (0.04)
(26 more...)

Genre:

Research Report > New Finding (0.86)
Research Report > Experimental Study (0.68)

Industry: Food & Agriculture > Agriculture (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.53)

Add feedback